Continuous energy demodulation methods and application to speech analysis

نویسندگان

  • Dimitrios Dimitriadis
  • Petros Maragos
چکیده

Speech resonance signals appear to contain significant amplitude and frequency modulations. An efficient demodulation approach is based on energy operators. In this paper, we develop two new robust methods for energy-based speech demodulation and compare their performance on both test and actual speech signals. The first method uses smoothing splines for discrete-to-continuous signal approximation. The second (and best) method uses time-derivatives of Gabor filters. Further, we apply the best demodulation method to explore the statistical distribution of speech modulation features and study their properties regarding applications of speech classification and recognition. Finally, we present some preliminary recognition results and underline their improvements when compared to the corresponding MFCC results. 2005 Elsevier B.V. All rights reserved.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Rolling Bearing Fault Analysis by Interpolating Windowed DFT Algorithm

This paper focuses on the problem of accurate Fault Characteristic Frequency (FCF) estimation of rolling bearing. Teager-Kaiser Energy Operator (TKEO) demodulation has been applied widely to rolling bearing fault detection. FCF can be extracted from vibration signals, which is pre-treatment by TEKO demodulation method. However, because of strong noise background of fault vibration signal, it is...

متن کامل

Robust energy demodulation based on continuous models with application to speech recognition

In this paper, we develop improved schemes for simultaneous speech interpolation and demodulation based on continuous-time models. This leads to robust algorithms to estimate the instantaneous amplitudes and frequencies of the speech resonances and extract novel acoustic features for ASR. The continous-time models retain the excellent time resolution of the ESAs based on discrete energy operato...

متن کامل

Continuous-time models for AM-FM signal demodulation and their application to speech recognition

Automatic speech recognition (ASR) systems can benefit from including into their acoustic processing part new features that account for various nonlinear and time-varying phenomena during speech production. In this paper, we develop robust continuoustime expansions used to demodulate the instantaneous amplitudes and frequencies of the speech resonances and extract novel acoustic features from s...

متن کامل

Instantaneous Energy Operators : Applications To

The nonlinear energy operator (x) _ x] 2 ? x x and its discrete-time counterpart have found numerous applications including development of the energy separation algorithm (ESA) for demodulat-ing AM-FM signals, tracking speech modulations, and detecting various events in nonstationary signals. In this paper we rst present some improvements on the energy operator and ESA when applied to demodulat...

متن کامل

Am-demodulation of Speech Spectra and Its Application to Noise Robust Speech Recognition

In this paper, a novel algorithm that resembles amplitude demodulation in the frequency domain is introduced, and its application to automatic speech recognition (ASR) is studied. Speech production can be regarded as a result of amplitude modulation (AM) with the source (excitation) spectrum being the carrier and the vocal tract transfer function (VTTF) being the modulating signal. From this po...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Speech Communication

دوره 48  شماره 

صفحات  -

تاریخ انتشار 2006